Phase Mapping as a Powerful Diagnostic of Primordial Non-Gaussianity 
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The identification and extraction of non-Gaussian signals is one of the main cosmological challenges 
facing future experimental measurements of the cosmic microwave background temperature pattern. 
We present a generalized statistical measure based on a novel technique representation of Fourier 
phases using the return map. We show that this method is both robust and powerful in comparison, 
for example, with morphological measures. 



I. INTRODUCTION 

One of the key challenges facing modern cosmology is 
the complete statistical characterization of the primordial 
density fluctuations believed to be the seeds of the large- 
scale structure of the Universe we see today. According to 
the prevailing orthodoxy, these initial perturbations were 
generated as quantum phenomena during an inflationary 
epoch Q. If this is the case they should display Gaus- 
sian statistics. Among other potentially testable conse- 
quences of this is that the small angular variations of the 
cosmic microwave background (CMB) sky should have 
the same statistical form [0||. Testing the hypothesis 
of primordial Gaussianity using maps of the CMB thus 
provides an opportunity to make experimental tests of 
ideas about the physics of the early universe. It is also 
necessary to devise robust statistical descriptors for the 
identification of foreground contaminations and system- 
atic instrumental artifacts. 

Searches for non-Gaussian (NG) signals within the 
COBE-DMR data have yielded positive detections that 
are at least partly explained by systematics . The need 
to devise more powerful statistical probes has been rec- 
ognized and acted upon, with key ideas focusing on the 
bispectrum || and higher-order polyspectra Some 
of these methods have been developed as far as practical 
implementation on real data sets ||]. These techniques 
are sensitive only to particular types of NG signal 0, 
so in this paper we present a new method that is both 
extremely general and extremely robust. 

II. PHASE MAPPING 

One of the most basic properties of a Gaussian random 
field (GRF) is that the Fourier phases are random 
and uniformly distributed between and 2tt. The Central 
Limit Theorem virtually guarantees that a superposition 
of a large number of Fourier modes with random phases 
will be Gaussian, so the random-phase hypothesis even 



serves as a definition of a form of Gaussianity. 

Testing the randomness of measured phases therefore 
provides a direct diagnostic of the statistical form of a 
fluctuation field. There are, however, some problems in 
using phases as a statistic. The foremost is that they 
are circular variables, only defined modulo 2tt. Tradi- 
tional measures of association, such as covariances of the 
form (4>f s 4 > h') > are based on the assumption that the re- 
lated measure is linear and are therefore inappropriate. 
Another problem is that phases are direct indicators of 
the morphology and location of specific features p0| , so 
are themselves not translational- invariant. As the first 
attempt to use phases as the basis of a statistical test, 
ref. (l^] used a quantity formed by taking the phase dif- 
ferences of neighboring Fourier modes in fc-space. This 
idea is the first step towards our new "phase-mapping" 
technique |l5j . We counter the problem of the circular 
nature of the phases by applying phases on to a "return 
map" , as follows. 

For a set of phases <pk from the Fourier transform of a 
one-dimensional process, a possible return map is a plot 
of (f>k against <f>k+i (neighboring phases) for all available 
pairs (4>k, 4>k+i) up to the Nyquist frequency k^/ 2 where 
N is the length of the process [|o). When the phases 
are random this will be a scatter plot with points dis- 
tributed randomly within the bounded square of both 
axes [0, 2tt]. For a realization of two or more dimensions, 
we can extend the return mapping between neighboring 
phases to any fixed scale Afc = (Ak x , Ak y ) = (m,n) (so 
mapping of neighboring phases is simply the mapping for 
|Afc| = 1). Therefore, this method forces to test random- 
ness between all pairs of (j)]^ and ^fc+Afc' ^ or au nxe d Afc 
scales in the Fourier domain. What results is a "super- 
map" , each pixel of which represents a return map with 
a fixed scale Afc. 

III. STATISTICAL ANALYSIS OF PHASE MAPS 

Having rendered the phase information in a pattern 
onto the phase map, we have to consider the construc- 
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tion of a statistical test using it. Our null hypothesis at 
each Afc case is a random (Poisson) distribution on the 
mapping square, so the randomness of phases is tested 
through 2 x m max x n max squares, where m max and n max 
are the maximal scale in k x and k y direction, respectively. 
For simplicity and computational ease [ p"3| , we apply a 
test (similar to the x 2 test) of the uniformity of the pix- 
elized mapped phases for different scales Afc as follows. 
A mean x 2 statistic is defined as 



X 2 = 



-y 



[p(i,j)-p] 



= ° 2 P, 



(1) 



where M is the number of pixels on the return 
map, p is the mean value for each pixel, and a 2 — 
■ \p(i,j)/p— l] 2 is the variance of the contrast 
of the return map. 

The return mapping of random phases will result in 
a Poisson distribution over all the map squares. For 
a Poisson distribution of integer p{i,j), its variance 
((Ap) 2 ) Poisso ^= £~ (p - p) 2 (pPe-P/p\) = p. Hence 

in this case X 2 Poisson = 1, and from Eq. a 2 oiBSOn = 
(p) ■ One could use this result to test the return map 
for consistency with pure discreteness fluctuations. How- 
ever, such a test is not so useful as it does not probe the 
spatial arrangement of the pattern. The individual re- 
turn maps tend to display particular patterns to which 
this would not be sensitive. We could, for example, pop- 
ulate the pixels with a Poisson distribution of counts 
but then rearrange them arbitrarily over the grid with- 
out changing the statistical properties mentioned above. 
One could sort the counts in ascending order, for exam- 
ple, producing a clearly non-uniform distribution that 
has the same statistics as the starting case. 

In order to apply this method more usefully we have to 
take into account the spatial correlations, which we can 
do using smoothing. For a 2D square p(i,j) = p{x), we 
use the form 



p(x,R) = 



d 2 x 
2^R2 



p(x ) exp — 



\x — x 
2R 2 



(2) 



where p(x, R) is the smoothed square and R is the 
smoothing scale. 

To obtain the expectation value of x 2 statistic of GRFs, 
we calculate first the variance of the contrast of the 
smoothed square, using a Gaussian window filter as in 
Eq. (]|). For a 2D field with power spectrum P(k) = Ak n , 
the variance of the contrast of the smoothed square ||| 
is a 2 {R) = (A/4ir)T(n + 2/2)R-^ l+2 \ where T(n) is the 
Euler Gamma function and R is the smoothing scale in 
Eq. (Q). A Poisson distribution has a power spectrum 
which is independent of wave number, i.e., its spectral 
index n — 0. So originally the variance of the con- 
trast Op oisson = A = (p) _1 , after smoothing, becomes 
^Poisson^) = (47ri? 2 p)- 1 . According to Eq. (Eh, we can 



obtain the (x 2 ) from the variance of the smoothed return 
map and the p, so for an ensemble of Poisson distribu- 
tions, the expectation mean x 2 



<X 2 ) 



4nR 2 ' 



(3) 



Thus, any smoothed return map of phase pairs at a 
fixed Afc that generates a x 2 value much higher than 
(AttR 2 )^ 1 should be interpreted as a diagnostic of phase 
coupling at that particular scale in phase space. To ap- 
ply phase mapping technique as a NG test, the null hy- 
pothesis requires that phase mapping for any fixed scale 
Afc = (m, n) should result in Poisson distribution in a re- 
turn map. Therefore, for a Gaussian realization, we have 
an ensemble of 2 x m max x n max smoothed return squares 
which are subject to statistical fluctuations around (x 2 )- 
Although phase mapping extracts all the information be- 
tween phase pairs, we have to know whether the value 
obtained from x 2 statistic is due to non-randomness or 
statistical fluctuation. As the x 2 statistic is related to 
the 'variance' of the smoothed Poisson-distributed re- 
turn map, the problem now is equivalent to finding the 
distribution of sampling variance from normal samples. 
The statistical significance of the departure of a mea- 
sured value of x 2 can be assessed using standard tech- 
niques for the sampling distribution of the variance in 
normal samples. When the sampling number is large 
enough, it behaves as Gaussian. We have M indepen- 
dent variables pi — pj ^/p, each of which is distributed 
normally with zero mean and variance (x 2 )- We are 
now looking for the distribution of sampling variance 
w = x 2 = YliiPi ~ P) 2 ' IPi which can be obtained 

as H, 



(M/2(x 2 ))^ 2 

- r[(M - 1)/2] e p 
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Therefore, the variance of the sampling x 2 measurements 
(M » 1) is E 2 = /;> - (x^)) 2 dF = (F) 2 /(Af/2). 
The dependence of E 2 on R can be obtained through 
Eq. (^), and the change of the effective pixel number M 
to (7r/2) 2 M/(47ri? 2 ) due to smoothing. Therefore, 



E 2 = 



f 



7T 3 # 2 (M/2)' 



(•5) 



We can examine NG signals in two ways. A pixel in 
the "super-map" which corresponds to vY* with v ~ 5, 
say, would be extremely unlikely if the null hypothesis 
held. As well as easily identifying such hotter pixels, we 
can also retrieve the corresponding Afc scale of the signal 
from the map. One could look at modes flagged in this 
way more carefully using other statistics if necessary. On 
the other hand, we can also inspect the global property of 
the distribution curve with (x 2 ) and E when the number 
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of 2 x m max x n max is large. Even when all signals are 
below 5E, the underlying obscure NG part could cause 
the distribution curve to skew, albeit no signal is above 
5S due to possible 'Gaussianization' by the noise. 



IV. APPLICATION TO SIMULATED CMB MAPS 

We apply this method to a simulated NG CMB map 
(Fig. 1(a)), which is a realization of 12.8 deg 2 CMB 
anisotropies due to the Kaiser-Stebbins effect from cos- 
mic strings Oj. We will test the non-Gaussianity by 
applying our method not only directly to the original 
manifestly NG map (S), but also to the combined map 
of itself plus Gaussian white noise (N) with 5 different 
fluctuation levels, as NG part may be embedded in white 
noise. The Gaussian white noise level are chosen with 
rms ratio S/N = 8, 4, 2, 1 and 1/2. 
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FIG. 1. The temperature map (flk) and the "super-map" 
in which each point represents the x 2 value for that (m, n 
scale. The histogram of the latter is shown in Fig. ^ (a). 



From the Fourier transform of a 7V 2 -mesh realiza- 
tion, we have N 2 /2 valid modes: k x S [-N/2 + I, N/2] 
and k y E [1,7V/ 2]. In general, the phase basis used 
to extract NG signals is flexible with the constraint 
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We have taken phases 



from the inner quarter of valid modes as a basis, i.e., 
k x € [-N/4+ I, N/4], and k y G [l,JV/4]. As the Gaus- 
sian noise normally dominates the power spectrum at 
large fe's, we can opt to avoid that part of 'Gaussianized' 
phases by shrinking the upper limit |fc| max of the phase 
base (not the upper limit of (m,n)). 
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FIG. 2. The histogram of x 2 statistics. Panel (a) is 
from the pure temperature map caused by cosmic strings of 
Kaiser-Stebbins effect. For comparison, the gray curve is from 
a realization of (random-phase) GRF. Panel (b), (c), (d), (e) 
and (f) are the histograms for the combined map of CMB 
plus Gaussian white noise with rms ratio S/N = 8, 4, 2, 1 
and 1/2, respectively. The upper horizontal axis is annotated 
in terms of the theoretical value E of GRFs with origin set 
at 6?) = (47T.R 2 )" 1 . The smoothing scale on the M = 128 2 
pixelized return map is R = 2. Therefore we regard those 
above 5E as against the null hypothesis, thus manifestation 
of phase coupling. We can see that phase mapping is able to 
detect considerable signals above 5E for the added Gaussian 
noise S/N = 4 and some for S/N = 2. Note that albeit no 
signals above 5E in (e), the median is shifted away from the 
(x 2 ) value, which is also NG manifestation. 

In Fig. H we show the histogram of the measured x 2 
statistics. Those above 5S are caused by phase coupling 
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of some Afc, which can be viewed in the (m, n)-map as 
in Fig.l (b). As a NG test, we are interested only in the 
amplitude of phase coupling in terms of x 2 > whatever the 
Afc. 

V. COMPARISON TO THE ANALYSIS OF 
MINKOWSKI FUNCTIONALS 

We now compare this method with a more standard 
morphological analysis of temperature contours, based on 
Minkowski functionals (MFs). The 3 MFs in 2D are the 
area (Vo), circumference (Vi), and the Euler characteris- 
tic (V2) of excursion regions The excursion region 
is the region on the sky map above a certain threshold 
amplitude fig] . In Fig. || we show the 3 MFs for the 
noise-contaminated cases: CMB plus noise with S/N = 4 
and 2 shown by solid curves. The shaded areas are from 
10 3 realizations of the same power spectrum of the cor- 
responding noise-contaminated CMB map but with ran- 
dom phases, which thus represents the statistical fluctu- 
ations of GRFs. We avoid quoting la around the mean 
MFs as the distribution function for each threshold is not 
normal (e.g. for high thresholds v, V2 oc v e~ v 1 2 
The case of S/N = 4 is out of the shaded area, a definite 
NG manifestation, but S/N = 2 is right on the edge. The 
dashed curves are taken from one Gaussian realization in 
order to show the variation of the MFs and the ambiguity 
that would result. 




FIG. 3. Minkowski functionals for the combined tempera- 
ture map with S/N = 4 (left column) and S/N = 2 (right) 
in solid curves against threshold (in terms of rms). From top 
downwards are the area, circumference and the Euler char- 
acteristic. The shaded areas are the possible ranges for each 
functional from 10 3 realizations of the same power spectrum 
(as the noise-added CMB map) but random phases and the 
dash curves are from one of the 10 3 Gaussian realizations. 



Other types of NG signals localized in harmonic space, 
e.g., artifacts from systematics such as slowly-rotating 
elliptical beam of the PLANCK mission will produce NG 
signals around £ p i x , which is more easily detected with 
harmonic analyses than real-space ones. By "localized 
in harmonic space" we mean the phases correlate mainly 
from large k. The contrary example is the simulations of 
spectral index n — 2 where phases correlate first at 
large Afc from all k. 

These results show that the phase-mapping technique 
is able to detect certain forms of non-Gaussianity even 
in the presence of noise, more effectively than the MF 
approach. The theoretical ground for phase mapping we 
have developed as a powerful NG diagnostic can be di- 
rectly applied not only on spherical analysis, but also on 
time-ordered scans of the upcoming planck mission. 
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